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ABSTRACT 



A field programmable gate array (FPGA) which includes 
first and second arrays of configurable logic blocks, and first 
and second configuration cache memories coupled to the 
first and second arrays of configurable logic blocks, respec- 
tively. The first configuration cache memory array can either 
store values for reconfiguring the first array of configurable 
logic blocks, or operate as a RAM. Similarly, the second 
configuration cache array can either store values for recon- 
figuring the second array of configurable logic blocks, or 
operate as a RAM. The first configuration cache memory 
array and the second configuration cache memory array are 
independently controlled, such that partial reconfiguration of 
the FPGA can be accomplished. In addition, the second 
configuration cache memory array can store values for 
reconfiguring the first (rather than the second) array of 
configurable logic blocks, thereby providing a second-level 
reconfiguration cache memory. 

37 Claims, 14 Drawing Sheets 
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1 2 

RAPIDLY RECONFIGURABLE FPGA configurations. For example, the first configuration cache 

HAVING A MULTIPLE REGION memory array can store values corresponding to four dif- 

ARCHITECTURE WITH ferent configurations of the first array of CLBs, and the 

RECONFIGURATION CACHES USEABLE AS second configuration cache memory array can store values 

DATA RAM 5 corresponding to four different configurations of the second 

array of CLBs. 

BACKGROUND OF THE INVENTION The values stored by the first and second configuration 

1 Field of the Invention cache arrays can include sets of configuration data values for 

The present invention relates to a dynamically reconfig- 10 r^^ 8 ro ^P° ndk 8 f» nfi f^° n «^ » 

urable field programmable gate array (FPGA). "f"* ™? of configurable logic blocks, as 

. . well as sets of state data values for programming corre- 

2. Description of Related Art sponding function generators in the first and second arrays 

FPGAs typically include configurable logic blocks ofconfigurable logic blocks. The sets of sute data values can 

(CLBs) which can be reconfigured to implement various be shared among the various configurations of the arrays of 

logical functions. The configuration of these CLBs is defined CLBs, thereby providing additional flexibility in the recon- 

by the data stored in a corresponding set of configuration figuration of the FPGA. 

memory cells. These configuration memory cells have been In a particular embodiment, the values stored by the first 

programmed in several different manners. Typically, the and second configuration cache memory arrays are loaded 

configuration memory cells are sequentially programmed in into the first and second arrays of CLBs, respectively, over 
response to an externally provided stream of configuration 20 a plurality of operating cycles . By spreading the transfer of 

data values. Newer technology, such as that described in va i ues over a plurality of operating cycles, the required 

U.S. Pat. No. 5,600,263, teaches that the configuration data routiog resources between the configuration cache memory 

values for a particular configuration can be stored in a arrays and the arrays of CLBs are greatly reduced. 

separate memory within the FPGA. The separate memory is ,„ „ • # . . . , t . - . . , 

. . ■ ii i * . r *i. c *• « In one vanation of the mvention, the first and second 

connected in parallel to each of the configuration memory 25 „ « u . ~ A . f 

rr, K. > . ro ^ A 7 c - y 7 configuration cache memory arrays are used as first-level 

ceUs. To reconfigure the FPGA, an entue se t of configuration ^ ^ cond . levcl nrigwLn cache memories, respec- 

dau values «ored m the separate memory is sunulUneously dvel In ^ vari ation, both the first and second configu- 

transferred from the separate memory to the configuration ^ * . . , r ^ ? 

ii a » « / . _ , B . ration cache memory arrays store values for reconfiguring 

memory cells. As a result, the reconfiguration of the entire t . « . _ cnit L . . , - & c ^ 

pdpa. ii • * * iw it. m the first array of CLBs. If the values required for reconfigu- 

FPGA is virtually instantaneous. However, a relatively large 30 „ f #K / „ mn , # , . 4 . 6 . 

. r / , . . , . * j * « i ration of the first array of CLBs are located in the second 

amount of overhead is incurred in providing parallel con- « , . . , 

- - 4 . r ^ y , . " configuration cache memory array, then these values are 

nections between the separate memory and each of the ♦ j p t . i / , 

- „_ T .... 7 . „ , l transferred from the second configuration cache memory 

configuration memory cells. In addition, because all of the am „ in tK « w# . . J 

fi „ .. „ . , , . . , array to the nrst configuration cache memory array, and then 

configuration memory cells are loaded simultaneously, par- ,„ ~ t rrriiTv a cAt\> i_ 

t -, „ mBfini , M4 ,. If t , ™, A . . - ,yV , ,c to the first array of CLBs. The second array of CLBs has a 

tial reconfiguration of the FPGA cannot be perfbrmed. 35 c nA - - IT - \, , 

x « , 1T ™, A , ^ tixed configuration in this variation. Using the second con- 
Moreover, while conventional FPGAs can be configured to c *• i_ j . , 

f* a „ „ . . D A w . 6 T;, figuration cache memory array as a second-level cache can 

provide for small amounts of user RAM, the capacity of this ™™..k, *u ~a * . , • 

!.^„damv« it i" *« .i l. iL j » * advantageously elimmate the need to access a device exter- 

user RAM is typically limited by the data storage capacity , , Tr t^j. e , , c , . . 

fA , . ... . s~rf t> nal to the FPGA for additional configuraUon data values, 

of function generators present within the CLBs. _ . . . . _ ^ & 

■* i j *u r , , . , . ra-r , _ , . , 4Q In another vanation, the first configuration cache memory 

It would therefore be desirable to have an FPGA which 40 array is used to store reconfiguration values for the first array 

reduces the amount of routing purees required to recon- of CLBS, while the second configuration cache memory is 

ml t ^ £ deB ? We ^ usedasa random access memory (user RAM). This user 

FPGA which can be readdypartially reconfigured. It would ram advantageously provides a relatively large on-chip 
also be desirable if such an FPGA were also capable of being ram resource for me ppcA. 

reconfigured to provide a relatively large user RAM. » 4 * u . c , 

& In yet another variation, the first and second configuration 

SUMMARY OF THE INVENTION cache memory arrays can both be used to implement user 

RAM. An expansion bus system is provided to allow the 

Accordingly, the present invention provides a field pro- user to select the size of the user RAM. 

grammable gate array (FPGA) comprising a first array of 50 The present invention will be more fully understood in 

CLBs and a second array of CLBs. The first array of CLBs view of the following detailed description and drawings, 
is coupled to a corresponding first configuration cache 

memory array. The first configuration cache memory array BRIEF DESCRIPTION OF THE DRAWINGS 

stores values for reconfiguring the first array of CLBs. mr> i ut u a- c i i-ti^a 

»■ •« * , & f *L _ . . J FIG. 1 is a block diagram of a conventional FPGA; 

Similarly, the second array of CLBs is coupled to a cone- 55 ^ . . .... c , 

sponding second configuration cache memory array. The Ra 2 * a ^ m ^ c dia & ram of a conventional CLB; 
second configuration cache memory array stores values for mG 3 * a schematic diagram of a CLE circuit found in 

reconfiguring the second array of CLBs. The first and second ^ c ^® °^ ^» 

configuration cache memory arrays are independently FIG- 4 is a schematic diagram of a fast carry logic circuit 

controllable, such that the first array of CLBs can be 60 w hich can be connected to the CLE circuit of FIG. 3; 
reconfigured in response to values stored in the first con- FIG. 5 is a schematic diagram of a user RAM configu- 

figuration cache memory array, while the second array of ration circuit which enables the CLE circuit of FIG. 3 to be 

CLBs is not reconfigured. Thus, the FPGA is partially configured as a user-accessible random access memory; 
reconfigurable. FIG. 6 is a block diagram of an FPGA in accordance with 

In a particular embodiment, the first and second configu- 65 one embodiment of the present invention; 
ration cache memory arrays each store values for reconfig- FIG. 7 is a schematic representation of the CLB array and 

uring the first and second arrays of CLBs in a plurality of configuration cache memory array of the FPGA of FIG. 6; 
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FIG. 8 is a block diagram which illustrates a portion of the 
FPGA of FIG. 6 in more detail; 

FIG. 9 is a block diagram illustrating a CLB and local 
cache memory block of the FPGA of FIG. 6 in more detail; 

FIG. 10 is a block diagram illustrating a CLE circuit and 
a local cache memory configured as user RAM; 

FIG. 11 is a block diagram of a plurality of CLBs and 
local configuration cache memory blocks configured as user 
RAM; 

FIG. 12 is a block diagram of an FPGA which uses one 
configuration cache memory half array to store configuration 
data and another configuration cache memory half array as 
user RAM; 

FIG. 13 a block diagram of an FPGA which uses one 
configuration cache memory half array as a first level 
configuration cache memory and another configuration 
cache memory half array as a second level configuration 
cache memory; and 

FIG. 14 is a block diagram of a transfer circuit located 
between a local cache memory and a CLB in accordance 
with an alternative embodiment of the invention. 

DETAILED DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram of a field programmable gate 
array (FPGA) 1 which includes a centrally located array 2 of 
configurable logic blocks (CLBs), such as CLB 100, and a 
plurality of programmable input/output blocks (lOBs) 3 
located around the perimeter of the . FPGA 1. Programmable 
interconnect resources are located within the CLB array 2, 
and extend between the CLB array 2 and the IOBs 3. In 
general, the CLB array 2 provides the functional elements 
for constructing a logic circuit. The IOBs 3 provide an 

interface between the external pins of the FPGA 1 and the 35 Similarly, multiplexer 109 routes the DIN, P, G f or H' signal 

logic circuit implemented by the CLB array 2. Hie program- to flip-flop 105. Multiplexers 111 and 112 route the K signal 

mable interconnect resources provide routing paths to con- or the inverse of the K signal to flip-flops 104 and 105, 

nect the CLBs 2 and IOBs 3 onto the desired networks. respectively. Multiplexers 113 and 114 route the EC signal 

Customized configuration of the FPGA 1 is achieved by or a logic ' V to flip flops 104 and 105, respectively, 

programming internal static configuration memory cells that ^ Multiplexers 107-114 are configured by programming asso- 

determine the logic functions and interconnections of the ciated configuration bit latches within CLE circuit 10. Each 

CLBs 2, IOBs 3 and interconnect resources. of multiplexers 108 and 110-114 is configured in response 

FIG. 2 is a schematic diagram of a CLB 100 which is used to a single configuration memory cell, whereas each of 

in the XC4000EX™ family of FPGAs available from multiplexers 107 and 109 is programmed in response to two 

Xilinx, Inc., 2100 Logic Drive, San Jose, Calif. 95124. CLB 45 configuration memory cells. 

100 includes a configurable logic element (CLE) 10 and Flip flops 104 and 105 are edge-triggered D-type flip- 
surrounding programmable interconnect resources 11. CLB flops which operate in response to a clock signal, a clock 
100 is positioned next to adjacent CLBs such that the enable (EC) signal, a data signal D, a set (SD) signal and a 
programmable interconnect resources 11 of CLB 100 are reset (RD) signal. Each flip flop can be triggered on either 
connected to the programmable interconnect resources of 50 the rising or falling clock edge as determined by inverters 
the adjacent CLBs. The programmable interconnect 117-118 and multiplexers 111-112. The set and reset signals 
resources 11 include a plurality of interconnect lines and a are provided by S/R control circuits 115-116 in response to 
plurality of programmable interconnection points (PIPs). the internal S/R control signal. The flip flops are either 
The PIPs, which are typically implemented using pass enabled by the internal EC control signal, or continuously 
transistors and/or multiplexers, are illustrated as small 55 enabled, as determined by the settings of multiplexers 
empty boxes. These pass transistors are programmed to be 113-114. Two configuration memory cells are required to 
either turned on (conducting) or turned off (non-conducting) store the data stored in flip flops 104 and 105. 
in response to data values stored in corresponding configu- Thus, a total of sixty configuration memory cells must be 
ration memory cells. In one variation, the PIPs can be programmed to configure CLE circuit 10. Of these sixty 
encoded so the required number of configuration memory 50 configuration memory cells, thirty four memory cells store 
cells is less than or equal to the total number of PIPs. In the 'state data'. State data is defined as the data stored in the 
XC4000EX family of FPGAs, there are approximately 489 configuration memory cells associated with flip flops 104 
corresponding configuration memory cells which must be and 105 and function generators 101 and 102 (as the 
programmed to configure the programmable interconnect function generators store state information when configured 
resources 11. 65 as user RALM as described in reference to FIG. 5). 

FIG. 3 is a schematic diagram of CLE circuit 10. CUE CLE circuit 10 can additionally include circuitry which 

circuit 10 includes function generators 101-103, flip-flops can be configured to implement fast carry logic (FIG. 4). 



104-105, programmable switch 106, multiplexers 107-114, 
set/reset (S/R) control circuits 115-116 and inverters 
117-118. CLE circuit 10 includes thirteen input terminals 
(which receive the input signals labeled F1-F4, G1-G4, 
C1-C4 and K) and four output terminals (which provide the 
output signals labeled X, Y, XQ and YQ). These input and 
output terminals provide access to the function generators 
101-103 and flip flops 104-105. These input and output 
terminals are connected to programmable interconnect 
resources 11 as illustrated in FIG. 2. 

Within CLE circuit 10, programmable switch 106 maps 
four control input signals C1-C4 into four internal control 
signals HI, DIN, S/R and EC in any arbitrary manner. Eight 
configuration memory cells must be programmed to control 
the routing performed by programmable switch 106. In other 
embodiments, more than eight configuration memory cells 
are programmed to control programmable switch 106. Func- 
tion generator 101 receives the four input signals G1-G4 
and provides an output signal G\ Similarly, function gen- 
erator 102 receives the four input signals F1-F4 and pro- 
vides an output signal P. Each of function generators 
101-102 is capable of implementing any arbitrarily defined 
Boolean function of their four input signals. To accomplish 
this, sixteen configuration memory cells must be pro- 
grammed within each of function generators 101-102. Func- 
tion generator 103 receives input signals F, G f and HI, and 
in response, provides an output signal H' which can be any 
Boolean function of these three input signals. To accomplish 
this, eight configuration memory cells must be programmed 
within function generator 103. 

The P or H' signal is routed through multiplexer 110 to the 
X output terminal. Similarly, the GI or Hi signal is routed 
through multiplexer 108 to the Y output terminal. Multi- 
plexer 107 routes the DIN, P, G' or H* signal to flip-flop 104. 
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FIG. 4 is a schematic diagram of a fast carry logic circuit 140 
which is connected to function generators 101-102. Fast 
carry logic circuit 140 includes AND gates 401-402, exclu- 
sive OR gates 403-406, multiplexers 407-415 and configu- 
ration memory cells 416-426. Twelve configuration 
memory cells must be programmed to configure fast carry 
logic circuit 140 (configuration memory cell 419 stores 
2-bits). The operation of fast carry logic circuit 140 is 
described in more detail in 'The Programmable Logic Data 
Book', (1996) pp. 4-21 to 4-23, which is available from 
Xilinx, Inc. 

FIG. 5 is a schematic diagram of a user RAM configu- 
ration circuit 150 which enables the function generators 101 
and 102 of CLE circuit 10 to be configured as a user- 
accessible random access memory (user RAM), such that 
data can be written to and read from the function generators 
101 and 102 during normal logical operation of the FPGA1. 
User RAM configuration circuit 150 includes AND gates 
121-122, OR gates 123-124, multiplexer 125 and configu- 
ration memory cells 131-133. In general, the F1-F4 and 
G1-G4 input terminals of function generators 101 and 102 
act as address lines to select particular memory cells in the 
function generators. The functionality of the control input 
signals C1-C4 are modified such that the HI, DIN and S/R 
lines become the write enable (WE) and data (Dl, DO) lines 
for the user RAM. The user RM control circuit 1550 is 
described in more detail in 'The Programmable Logic Data 
Book', (1994) pp. 2-20 to 2-21, available from Xilinx, Inc. 
(Note that the Xilinx Data Books refer to CLE circuit 10 as 
a CLB.) The user RAM control circuit 150 is configured by 
programming three configuration memory cells. 

To summarize, the configuration of CLB 100 is defined by 
the data stored in approximately 564 configuration memory 
cells. Of these 564 configuration memory cells, 34 store 
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CLB array 601 is given a width of 48 CLBs to provide a 
square die for FPGA 600. Note that each CLB and its 
associated local configuration cache memory block has a 2:1 
aspect ratio. 

5 CLB array 601 is partitioned into a left half-array 601L 
which is made up of 24 rows and 24 columns of CLBs, and 
a right half-array 601R which is also made up of 24 rows and 
24 columns of CLBs. As described in more detail below, 
each of the CLB half-arrays 601L and 601R is independently 

10 re configurable. 

Similarly, configuration cache memory array 602 is made 
up of 24 rows and 48 columns of local configuration cache 
memory blocks. Configuration cache memory array 602 is 
partitioned into a left half-array 602L which is made up of 
24 rows and 24 columns of local configuration cache 
memory blocks, and a right half -array 602 R which is also 
made up of 24 rows and 24 columns of local configuration 
cache memory blocks. The 24 rows of CLB array 601 are 
interleaved with the 24 rows of configuration cache memory 
array 602 as illustrated in FIG. 6. 

Each of the CLBs in CLB array 601 has a corresponding 
local configuration cache memory block in configuration 
cache memory array 602. As described in more detail below, 
each local configuration cache memory block has the capac- 
ity to store the configuration data values and the state data 
values for four separate configurations of the corresponding 
CLB. 

FIG. 7 is a schematic representation of CLB array 601 and 
configuration cache memory array 602. In this 
representation, the configuration cache memory array 602 is 
located ' beneath * the CLB array 601. Configuration data and 
state data are transferred from the configuration cache 
memory half-arrays 602L and 602R into CLB half-arrays 



20 



25 



30 



state data and 530 store configuration data. In the past, these 35 601L and 601R, respectively. The CLBs in CLB array 601 

£± a. * . . _ 11 f t * • • -g . • • . « 



are interconnected to one another by the programmable 
interconnect resources of the CLBs (See, e.g., program- 
mable interconnect resources 11 of FIG. 2). As described in 
more detail below, the configuration cache memory array 
602 is typically logically invisible to the CLB array 601 
during the normal logical operation of the CLB array 601 
(unless the configuration cache memory array 602 is being 
used as user memory, as described in more detail below). 
FIG. 8 is a block diagram which illustrates a portion of 



configuration memory cells have been programmed in sev- 
eral different manners. Typically, the configuration memory 
cells are sequentially programmed in response to an exter- 
nally provided configuration bit stream. Newer technology, 
such as that described in U.S. PaL No. 5,600,263, teaches 40 
that the configuration data and state data for a particular 
configuration can be stored in a separate memory of the 
FPGA 1. As previously described, the separate memory is 

connected in parallel to each of the configuration memory 

celIs » such that an entire set of the configuration data values 45 FPGA 600 in more detail FIG. 8 illustrates configuration 

and state data values can be simultaneously loaded from the port 603, cache configuration bus 610 (which is part of 

separate memory to the configuration memory cells. memory interface circuit 605 (FIG. 6)), CLBs 701-712, 

In contrast, and as described in more detail below, the corresponding local configuration cache memory blocks 

present invention provides a configuration cache memory 721-732, and the local bus structure which exists between 

block within each CLB which loads the configuration 50 each CLB and its corresponding local configuration cache 

memory cells over a plurality of clock cycles. By doing so, memory block. CLBs 701-706 are part of CLB half -array 

the routing overhead associated with the configuration cache 601 L, and CLBs 707-712 are part of CLB half-array 601R. 

memory is greatly reduced. Similarly, local configuration cache memory blocks 

FIG. 6 is a block diagram of an FPGA 600 in accordance 721-726 are part of configuration cache memory half -array 

with one embodiment of the present invention. FPGA 600 55 602L, and local configuration cache memory blocks 

includes a CLB array 601, a configuration cache memory 727-732 are part of configuration cache memory half-array 

array 602, a configuration port 603, a region of lOBs 604, 602R. 

and a memory interface circuit 605. In the described Cache configuration bus 610 extends between the left 

embodiment, CLB array 601 is made up of 24 rows and 48 half-arrays 601L, 602L and the right half-arrays 601R, 

columns of XC4000EX CLBs. Thus, each of the CLBs in 60 602R. Each of the local configuration cache memory blocks 

CLB array 601 includes the circuitry previously described in 721-732 is connected in parallel to cache configuration bus 

connection with FIGS. 2-5. Other types of CLBs can also be 610. The cache configuration bus 610 is further coupled to 

used, and are considered within the scope of the present configuration port 603, thereby providing an interface 

invention. CLB array 601 is given a height of 24 CLBs to between the configuration port 603 and the configuration 

facilitate 32-bit arithmetic (16 CLBs high), with additional 65 cache memory array 602. This interface is a random access 

space above and below for control logic being implemented (read and write) interface between configuration port 603 

with CLBs located near the top and bottom of this height. and configuration cache memory array 602. As a result, 
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configuration cache memory array 602 can be accessed 
directly by an external processor through configuration port 
603. In the described embodiment, the cache configuration 
bus 610 includes a dedicated data bus which is eight bits 
wide. Each of the local configuration cache memory blocks 5 
721-732 is assigned a unique access address to enable these 
local configuration cache memory blocks to be individually 
addressed by signals transmitted on the cache configuration 
bus 610. Thus, configuration bus 610 includes a dedicated 
address bus having a width which is sufficient to carry these 10 
cache memory block addresses. 

A 48-bit wide bus extends between each of CLBs 
701-712 and its corresponding local configuration cache 
memory block. For example, 48-bit wide bus 750 extends 
between CLB 701 and local configuration cache memory 15 
block 721. As described in more detail below, these 48-bit 
wide buses are used to transfer configuration data values and 
state data values from local configuration cache memory 
blocks 721-732 to the corresponding CLBs 701-712. An 
8-bit address bus also extends from each of CLBs 701-712 
to its corresponding local configuration cache memory 
block. For example, 8-bit address bus 751 extends from 
CLB 701 to local configuration cache memory block 721. As 
described in more detail below, these 8-bit wide address 
buses are used to transfer memory addresses from CLBs 25 
701-712 to local configuration cache memory blocks 
721-732. A pair of 2-bit data buses also extends between 
each of CLBs 701-712 and its corresponding local configu- 
ration cache memory block. For example, 2-bit data bus 752 
extends from CLB 701 to local configuration cache memory 30 
block 721, and 2-bit data bus 753 extends from local 
configuration cache memory block 721 to CLB 701. As 
described in more detail below, these 2-bit wide data buses 
are used to transfer data between CLBs 701-712 and local 
configuration cache memory blocks 721-732. Finally, 35 
expansion buses, such as expansion buses 754 and 755, 
extend between the local configuration cache memory 
blocks 721-732 as illustrated. As described in more detail 
below, the 4-bit wide expansion buses 754 and the 10-bit 
wide (8-bit+2-bit) expansion buses 755 are used to join local 40 
configuration cache memory blocks to form a larger user 
memory. 

FIG. 9 is a block diagram illustrating CLB 701 and local 
cache memory block 721 in more detail. CLB 701 and local 
cache memory block 721 are representative of the other 45 
CLBs and local cache memory blocks of FPGA 600. CLB 
701 includes CLE circuit 901 and programmable intercon- 
nect resources 902. CLB 701 is substantially identical to 
CLB 100 (FIGS. 2-5). Thus, CLE circuit 901 and program- 
mable interconnect resources 902 are substantially identical so 
to the previously described CLE circuit 10 and program- 
mable interconnect resources 11. As a result, CLB 701 
includes approximately 564 configuration memory cells 
which must be programmed in order to configure CLB 701. 

Local configuration cache memory block 721 is parti- 55 
u'oned into four local cache memories 801-804. Each of the 
local cache memories 801-804 has a width of 48 columns 
and a depth of 13 rows. In other embodiments, the local 
cache memories 801-804 can have other dimensions. Each 
of the local cache memories 801-804 is further partitioned 60 
into a configuration data cache memory and a state data 
cache memory. Thus, local cache memory 801 is partitioned 
into configuration data cache memory 811 and state data 
cache memory 821. Local cache memory 802 is partitioned 
into configuration data cache memory 812 and state data 65 
cache memory 822. Local cache memory 803 is partitioned 
into configuration data cache memory 813 and state data 



cache memory 823. Finally, local cache memory 804 is 
partitioned into configuration data cache memory 814 and 
state data cache memory 824. 

Each of the configuration data cache memories 811-814 
is 48 columns wide by 12 rows deep. Each of the configu- 
ration data cache memories 811-814 stores the 530 configu- 
ration data values required to configure CLB 701. These 530 
configuration data values are stored as 12 words, each 
having a width of 48 bits. Not all of the 12x48 bits are used 
to store configuration data values. Unused bits can be used 
for other purposes, such as storing save and restore 
addresses, as described in more detail below. 

Each of the state data cache memories 821-824 is 48 
columns wide by 1 row deep. Each of the state data cache 
memories 821-824 stores the 34 state data values required 
to restore the status of the function generators and flip flops 
of CLB 701 (fourteen of the 48 columns of each state data 
cache memory are not used to store state data values). 

The entries of the local cache memories 801-804 which 
are not used to store configuration data values or state data 
values can be used to store a save address and a restore 
address. The save address indicates the local cache memory 
to which the state data values are to be saved when the 
configuration of CLB 701 is modified, and the restore 
address indicates the local cache memory from which the 
state data values are to be restored when the configuration of 
CLB 701 is modified. 

For example, a 2-bit save address can be stored at a first 
predetermined location within local cache memory 801, and 
a 2-bit restore address can be stored at a second predeter- 
mined location within local cache memory 801. The 2-bit 
save address identifies the local cache memory to which the 
current state data values will be saved when CLB 701 is 
reconfigured from the configuration defined by local cache 
memory 801 to another configuration. The 2-bit save address 
can identify any one of the four local cache memories 
801-804. 

In one example, the 2-bit save addresses of *00\ '01% '10* 
and ' 11' identify local cache memories 801, 802, 803 and 
804, respectively. In such an example, suppose that the 2-bit 
save address stored in local cache memory 801 has a value 
of '10', thereby identifying local cache memory 803. Further 
suppose that the configuration data values, including the 
save address bits, associated with local cache memory 801 
are currently used to configure CLB 701. When the CLB 701 
is to be reconfigured with configuration data values from 
another one of local cache memories 802-804, the current 
state data values stored by CLB 701 will be stored to state 
data cache memory 823 of local cache memory 803 (rather 
than state data cache memory 821 of local cache memory 
801) in response to the 2-bit save address having a '10* 
value. 

In a similar manner, the 2-bit restore address identifies the 
local cache memory from which the current state data values 
will be retrieved when CLB 701 is configured in response to 
configuration data values stored in a particular local cache 
memory. The 2-bit restore address can identify any one of 
the four local cache memories 801-804. 

In one example, the 2-bit restore addresses of * 00', 1 01% 
'10' and '11' identify local cache memories 801, 802, 803 
and 804, respectively. In such an example, suppose that the 
2-bit restore address stored in local cache memory 801 has 
a value of 1 11*, thereby identifying local cache memory 804. 
Further suppose that the configuration data values, including 
the restore address, associated with local cache memory 801 
are currently being loaded to configure CLB 701. When 
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restoring the state data values to CLB 701, the state data 
values stored in state data cache memory 824 of local cache 
memory 804 will be loaded to configure CLB 701 in 
response to the 2-bit restore address having a '11' value 
(rather than the state data values stored in state data cache 
memory 821 of local cache memory 801). 

Enabling the state data values to be saved and restored 
from different local cache memories as described above, 
advantageously expands the configuration and reconfigura- 
tion possibilities of CLB 701. 

In an alternative embodiment, one or more of the local 
cache memories 801-804 are used to store more than one set 
of state data values. For example, the thirteen rows of local 
cache memory 804 can be used to store thirteen different sets 
of state data values. That is, the configuration data cache 
memory 814 of local cache memory 804 is used to store state 
data values instead of configuration data values. This pro- 
vides a larger capacity for storing state data values. In the 
previously described example, there would be sixteen acces- 
sible sets of state data values (i.e., 1 set of state data values 
in each of local cache memories 801-803 and 13 sets of state 
data values in local cache memory 804). Each of these 
sixteen sets of state data values could be accessed by each 
of local cache memories 801-803 by implementing a 4-bit 
save address and a 4-bit restore address within each of local 
cache memories 801-803 in the manner previously 
described. All of the addresses of the local cache memories 
801-804 are available to store state data values (up to 52 sets 
of state data values). 

In accordance with one embodiment of the invention, 
CLB 701 is reconfigured from local cache memory 801 as 
described in detail below. It is understood that CLB 701 can 
also be reconfigured in a similar manner from local cache 
memories 802-804. That is, any one of the four local cache 
memories 801-804 can be selected to reconfigure CLB 701. 
It is further understood that all of the CLBs in CLB 
half-array 601L (FIG. 6) are reconfigured at the same time 
as (i.e., in parallel with) CLB 701. All of the CLBs in CLB 
half-array 601R (FIG. 6) can also be configured at the same 
time, although this is not necessary. The reconfiguration of 
CLB half-array 601L and the reconfiguration of CLB half- 
array 601R are independently controllable events. 

CLB 701 is reconfigured using a 4-step, 15-cycle process. 
In the first step, which takes one cycle, the 34 bits of state 
data currently stored in the function generators and flip flops 
of CLB 701 are written to state data cache memory 821 (i.e., 
the 13th row of local cache memory 801). (This example 
assumes that the save address of the current configuration 
identifies local cache memory 801.) In the second step, 
which also takes one cycle, all of the configuration memory 
cells of CLB 701 are cleared to avoid contention during the 
subsequent reconfiguration steps. In the third step, which 
takes twelve cycles, the configuration data values stored in 
configuration data cache memory 811 are transferred to CLB 
701 over the 48-bit bus 750 (FIG. 8). More specifically, the 
twelve rows of configuration data cache memory 811 are 
sequentially loaded to the configuration memory cells of 
CLB 701 during 12 consecutive cycles. That is, one row of 
configuration data cache memory 811 is written to CLB 701 
per cycle. This transfer is completed, for example, by using 
conventional address sequencing circuits to access both data 
cache memory 811 and the configuration memory cells of 
CLB 701. In the fourth step, which takes one cycle, the state 
data values stored in state data cache memory 821 are 
transferred to the function generators and flip flops of CLB 
701 over the 48-bit bus 750 (FIG. 8). (This example assumes 
that the restore address of the loaded configuration identifies 
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state data cache memory 821 of local cache memory 801.) 
Assuming a transfer rate of 60 MHz, the reconfiguration 
time is approximately 250 nsec. 

The first step (saving the state data values) and the fourth 

5 step (restoring the state data values) are performed using 
configuration dependent addresses, thereby enabling the 
state data values to be shared among the four possible 
configurations as previously described. For example, in the 
previous example, the state data values could have been 

10 saved in any one of the state data cache memories 821-824 
in the first step, even though CLB 701 was receiving 
configuration data values from configuration data cache 
memory 811. Similarly, the state data values could have been 
restored from any one of the state data cache memories 

15 821-824 during the fourth step, even though CLB 701 was 
receiving configuration data values from configuration data 
cache memory 811. 

The configuration data values and state data values which 
are stored in configuration cache memory array 602 (FIGS. 

20 7 and 8) are initially loaded into the configuration cache 
memory array 602 from an external source, such as a control 
processor, through the configuration port 603 and the 
memory interface circuit 605. This facilitates a single cycle 
transfer of state data values as follows. First, the state data 

25 values are written from the configuration port 603 to the 
locations identified by the current restore addresses within 
the local cache memories. These state data values are then 
immediately restored (i.e., written to the associated CLBs) 
without reconfiguring the associated CLBs. 

30 Moreover, because the configuration cache memory array 
602 remains coupled to the configuration port 603 and the 
memory interface circuit 605 during normal operation of 
FPGA 600, an external device can access the configuration 
and state data values stored in configuration cache memory 

35 array 602 during normal operation of FPGA 600. Note that 
in order to read the current information from the memory 
cache, the values in the cache must be updated using a save 
operation (that can occur without reconfiguring the associ- 
ated CLE). 

40 256x2 RAM 

In an alternate embodiment of the present invention, any 
one or more of the local cache memories 801-804 can be 
modified to operate as a 256x2 bit random access memory. 
FIG. 10 is a block diagram illustrating the interconnections 

45 provided between CLE circuit 901 and local cache memory 
801 to achieve a 256x2 RAM. Input terminals F1-F4 and 
G1-G4 of the CLB 701 are connected to address bus 751 as 
illustrated. Address bus 751 is selectively coupled to local 
cache memory 801 by programming a plurality of program- 

50 mable interconnect points (not shown). These program- 
mable interconnection points are programmed by corre- 
sponding configuration memory cells. These memory cells, 
in turn, are loaded with values stored in the configuration 
cache memory. In this manner, the input signals F1-F4 and 

55 G1-G4 are provided to local cache memory 801 as an 8-bit 
address signal. This address signal is used by conventional 
address decoder circuitry within local cache memory 801 to 
address one of 256 addresses within local cache memory 
801. Each of the 256 addresses is associated with a 2-bit data 

60 value. 

To write a data value to local cache memory 801, a 2-bit 
data value is routed from the lines providing the CI and C2 
input signals to local cache memory 801 on bus 752, and a 
write enable signal is routed from the line providing the C3 
65 input signal to local cache memory 801 as illustrated. The 
2-bit data value provided on bus 752 is written to local cache 
memory 801 at the address defined by the input signals 
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F1-F4 and G1-G4. In this manner, 2-bit data values can be 
written to each of the 256 addresses in local cache memory 
801. Note that in another embodiment, using programmable 
switch 106, internal signals HI, DIN, and S/R become 
signals WE, Dl, and DO, respectively (see FIG. 5). 

To read a data value from local cache memory 801, the 
write enable signal on input terminal C3 is de-asserted. In 
response, local cache memory 801 provides the 2-bit data 
value corresponding with the current address signal to 
multiplexers 1001 and 1002 on bus 753 as illustrated. 
Multiplexers 1001 and 1002 are programmed (by configu- 
ration memory cells) to pass the 2-bit data value from local 
cache memory 801. The 2-bit output data value can be read 
out of CLE circuit 901 through the X and Y output terminals 
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response to these signals, local cache memories determine 
whether they are to be accessed. For example, if the WE 
signal is asserted, then local cache memories 1111-1112, 
1114-1115 and 1117-1118 are informed that the current 
operation is a write operation. If the CS[0] signal is asserted, 
and CS[1] and CS[2] signals are not asserted, then local 
cache memories 1111 and 1112 are informed that they are 
being selected for the write operation. If the PS signal has a 
value of '00% then local cache memories 1111 and 1112 are 
informed that a particular one (e.g., the first) of the four 
planes of 256x2 memory is selected for the write operation. 

The address and data values are provided to the 3Kx4 
memory through CLBs 1101 and 1102. CLB 1101 and local 
cache memory 1111 are configured in the same manner as 



or through flip flops 104 and 105 in the same manner as the 15 previously described for CLB 901 and local cache memory 
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normal output signals F and G' of function generators 101 
and 102. Note that when local cache memory 801 is not 
operating as a 256x2 RAM, multiplexers 1001 and 1002 are 
programmed to pass the output signals of function genera- 
tors 101 and 102, respectively. 

Each CLB half-array 601L, 601R (FIGS. 7 and 8) has a 
capacity of 36 Kbytes per configuration (256x2 bits per local 
cache memoryxl byte/8 bitsx24x24 local cache memories 
per CLB half -array), with a maximum of 288 Kbytes avail - 



801 in FIG. 10. CLB 1102 and local cache memory 1112 are 
configured in the same manner as CLB 1101 and local cache 
memory 1111. Thus, CLB 1101 is coupled to receive address 
signals A7: AO and data signals D3:D2. Similarly, CLB 1102 
20 is coupled to receive address signals A7: AO and data signals 
DL:D0. CLBs 1101 and 1102 provide data output signals 
B3:B2 and B1:B0, respectively. 

The address signals A7:A0 and data signals D3:D2 
received by CLB 1101 are routed to local cache memory 



able for all four configurations of both half-arrays (36 25 1111 in the manner previously described in connection with 
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FIG. 10. In addition, these address signals A7: AO and data 
signals D3:D2 are routed to local cache memories 1114 and 
1117 on expansion buses 755. In a similar manner, expan- 
sion buses 755 are used to route data values read from local 
cache memories 1114 and 1117 back to CLB U01. In this 
manner, expansion buses 755 allow all address signals and 
data signals for local cache memories 1111, 1114 and 1117 
to be routed through CLB 1101. As a result, CLBs 1104 and 
1107 are free to perform other functions. 

Similarly, the address signals A7:A0 and data signals 
D1:D0 received by CLB 1102 are routed to local cache 
memory 1112 in the manner previously described in con- 
nection with FIG. 10. In addition, these address signals 
A7:A0 and data signals D1:D0 are routed to local cache 



Kbytesx4 configurations per CLB half*arrayx2 CLB half- 
arrays per FPGA). 

The 256x2 RAM can be expanded by including one or 
more of the local cache memories 802-804, in addition to 
local cache memory 801. This RAM can further be 30 
expanded by including one or more of the local configura- 
tion cache memory blocks 722-732, in addition to local 
configuration cache memory block 721. 

FIG. 11 is a block diagram which illustrates a 3Kx4 RAM 
which is formed by such expansion. The RAM includes nine 35 
CLBs 1101-1109 and their corresponding local cache 
memories 1111-1119. CLBs 1103, 1106 and 1109 are used to 
control the operation of the RAM, while local cache memory 
blocks 1111-1112, 1114-1115 and 1117-1118 provide the 

memory storage cells of the RAM. The local cache memory 40 memories 1115 and 1118 on expansion buses 755. In a 
blocks 1111-1112, 1114-1115, and 1117-1118 are arranged similar manner, expansion buses 755 are used to route data 
in three columns as illustrated, with each column being values read from local cache memories 1115 and 1118 back 
controlled by a corresponding one of the CLBs 1103, 1106 to CLB 1102. In this manner, expansion buses 755 allow all 
and 1109. address signals and data signals for local cache memories 

Each of CLBs 1103, 1106 and 1109 receives a write 45 1112, 1115 and 1118 to be routed through CLB 1102. As a 
enable WE signal, a column select CS signal and a plane result, CLBs 1105 and 1108 are free to perform other 
select PS signal. Each of the three CLBs 1103, 1106 and functions. 

1109 receives a corresponding 1-bit column select signal It is important to note that expansion buses 754 and 755 

CS[0], CS[1] or CS[2]. At most, one of these column select are configurable, such that user RAMS of different depths 
signals is asserted at any given time, thereby selecting one 50 and widths can be constructed. For example, the connection 
of the three columns. The PS signal is a 2-bit signal which from address bus 751 to expansion bus 755 is configurable 
is provided to each of the three CLBs 1103, 1106 and 1109 such that address bus 751 may or may not connect to 
to select one of the four local cache memories within each expansion bus 755 within local configuration cache memory 
of the local cache memory blocks 1111-1112, 1114-1115 block 721. In addition, the continuation of the expansion bus 
and 1117-1118. At any given time, the PS signal enables one 55 755 along the horizontal direction is configurable, thereby 



of the four local cache memories within one of the three 
columns. The WE signal is a 1 bit write enable signal which 
is provided to each of the three CLBs 1103, 1106 and 1109 
to select a write (or read) operation within the local cache 
memory blocks 1111-1112, 1114-1115 and 1117-1118. 

The CS, PS and WE signals are routed through CLBs 
1103, 1106 and 1109 to local cache memories 1113, 1116 and 
1119, respectively using bus 751 (FIG. 10). Local cache 
memories 1113, 1116 and 1119, in turn, route these CS, PS 



determining the width of the resulting RAM array. Similarly, 
the connections from expansion bus 754 to the various CLBs 
and the local cache memories are configurable (as is the 
continuation of expansion bus 756 in the vertical direction), 
60 thereby allowing the height of the resulting RAM array to be 
programmably selected. 

In yet another embodiment of the present invention, the 
configuration cache memory half-arrays 602L and 602R can 
be independently controlled such that one of the configura- 
and WE signals to expansion buses 754. Expansion buses 65 tion cache memory half arrays is used as a configuration 
754 route these signals to the local cache memories cache memory which stores configuration data values and 
1U1-1U2, 1114-1115 and 1117-1118 as illustrated. In state data values for reconfiguring the corresponding CLB 
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half-array (see, FIGS. 7-9), and the other one of the con- 
figuration cache memory half arrays is used as an indepen- 
dent random access memory (see, FIGS. 10-11). FIG. 12 is 
a schematic representation of this embodiment, which illus- 
trates configuration cache memory half-array 602R being 
used as a configuration cache memory for CLB half-array 
601R, and configuration cache memory half-array 602 L 
being used as a RAM. The configuration of CLB half-array 
601L is fixed in this embodiment. This hybrid use advanta- 
geously provides operating flexibility to the FPGA 600. 
RAM half-array 602 L can also be accessed through memory 
interface 605 and configuration port 603. Memory interface 
605 (FIG. 6) would also be accessible to the CLB half array 
601L. 
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cycle, A[47:24] is loaded into register 1403. During the 
second cycle, another 72-bit word B[71:0} is provided on 
bus 1411. The first 24 bits B[71:48] are provided to multi- 
plexer 1401, the second 24 bits B[47:24] are provided to 
registers 1403 and 1404, and the third 24 bits B[23:0] are 
provided to multiplexer 1402. During the second cycle, 
multiplexers 1401 and 1402 are controlled to pass B[71:48] 
and B[23:0] to the 48-bit bus 1412. Also during the second 
cycle, B[47:24] is loaded into register 1404. During the third 
cycle, multiplexer 1401 is controlled to pass A[47:24] from 
register 1403 to bus 1412, and multiplexer 1402 is controlled 
to pass B[47:24] from register 1404 to bus 1412. The 
foregoing pattern is repeated until the CLB 701 is reconfig- 
ured. In an alternate embodiment, register 1404 is 



In yet another embodiment, one of the configuration 15 ?^f tcd ' ao u d B u [4 * 24] * apphed direCtly to multl P lexer 
cache memory half-arrays 602L and 602R is used as first 1^ durii^ tte third cycle. 

level configuration cache memory, and the other one of the .^though the invention has been described m connection 
configuration cache memory half-arrays 602L and 602R is T scverd embodiments, it is understood that tins inven- 
used as a second level configuration cache. FIG. 13 is a Uon * °f to * e e °*«^nts disclosed, but is 

schematic representation of this embodiment, which illus- 20 CapabIc of ™™ m °<^ 10as which would be apparent to 



trates configuration cache memory half -array 602R being 
used as a first level configuration cache memory for CLB 
half array 601R, and configuration cache memory half-array 
602L being used as a second level c array 601 R. The he 
memory for CLB half array 601 R. The configuration of CLB 25 
half-array 601L is fixed in this embodiment. If the configu- 
ration data required by CLB half-array 601 R resides in the 
second level configuration cache memory (i.e., configuration 
cache memory half-array 602 L), then this configuration data 
is transferred from the second level configuration cache 30 
memory to the first level configuration cache memory (i.e., 
configuration cache memory half array 602 R) through the 
memory interface circuit 605. In making this transfer, one of 
the sets of configuration data previously stored in the first 
level configuration cache memory is overwritten with the set 35 
of configuration data transferred from the second level 
configuration cache memory. The transfer of a set of con- 
figuration data from the second level configuration cache 
memory to the first level configuration cache memory can be 
performed in approximately 50 microseconds by using all of 40 
the buses between the CLB rows in parallel. The transferred 
set of configuration data is then transferred from the first 
level configuration cache memory to CLB half-array 601R. 
In this manner, the two configuration cache memory half- 
arrays 602 L and 602R can store up to eight sets of configu- 45 
ration data for one of the CLB half-arrays 601 L or 601R. 

The previously described FPGA is useful in various 
applications, including those which implement dynamic 
configuration, virtual logic, reconfigurable co-processing, 
and reconfigurable computing 

Although the local cache memories (e.g., local cache 
memory 801 (FIG. 9)) have been described as having 13 
rows and 48 columns, it is understood that in other 
embodiments, other dimensions can be used. For example, 
the local cache memories could have 72 columns and 9 
rows. FIG. 14 illustrates a transfer circuit 1400 which is 
located between the local cache memory 801 and the CLB 
701 in such an embodiment. Transfer circuit 1400 includes 
multiplexers 1401 and 1402 and registers 1403 and 1404. 



a person skilled in the art. For example, other embodiments 
of the present invention may include different state storage 
structures, such as RAM. Thus, the invention is limited only 
by the following claims. 
We claim: 

1. A field programmable gate array (FPGA) comprising: 
a first array of configurable logic blocks having a corre- 
sponding first set of configuration memory cells; 

a first configuration cache memory array coupled to the 
first array of configurable logic blocks, wherein the first 
configuration cache memory array stores values to be 
loaded into the first set of configuration memory cells, 
thereby reconfiguring the first array of configurable 
logic blocks; 

a second array of configurable logic blocks having a 
corresponding second set of configuration memory 
cells; 

a second configuration cache memory array coupled to the 
second array of configurable logic blocks, wherein the 
second configuration cache memory array stores values 
to be loaded into the second set of configuration 
memory cells, thereby reconfiguring the second array 
of configurable logic blocks; and 
a control circuit for transferring values between the first 
configuration cache memory array and the first set of 
configuration memory cells, and for independently 
transferring values between the second configuration 
cache memory array and the second set of configuration 
memory cells. 

2. The FPGA of claim 1, wherein the first configuration 
cache memory array stores values for reconfiguring the first 
array of configurable logic blocks in a plurality of configu- 
rations. 

3. The FPGA of claim 2, wherein the second configuration 
55 cache memory array stores values for reconfiguring the 

second array of configurable logic blocks in a plurality of 
configurations. 

4. The FPGA of claim 1, wherein the values stored in the 
first configuration cache memory array comprise a first set of 
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During a first clock cycle, a 72-bit word A[71:0] is provided 60 configuration data values for programming corresponding 



on bus 1411. The first 24 bits A[71:48] of this 72-bit word 
are provided to multiplexer 1401, the second 24 bits 
A[47:24] of this 72-bit word are provided to registers 1403 
and 1404, and the third 24 bits A[23:0] of this 72-bit word 



are provided to multiplexer 1402. During the first cycle, 65 perform a first function. 



configuration memory cells in the first array of configurable 
logic blocks in a first configuration, and a first set of state 
data values for programming corresponding state storage 
structures in the first array of configurable logic blocks to 



multiplexers 1401 and 1402 are controlled to pass A[71:48] 
and A[23:0] to the 48-bit bus 1412. Also during the first 



5. The FPGA of claim 4, wherein the values stored in the 
first configuration cache memory array further comprise a 
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second set of configuration data values for programming 
corresponding configuration memory cells in the first array 
of configurable logic blocks in a second configuration, and 
a second set of state data values for programming corre- 
sponding state storage structures in the first array of con- 
figurable logic blocks to perform a second function. 

6. The FPGA of claim 5, further comprising means for 
loading the first set of configuration data values and the first 
set of state data values from the first configuration cache 
memory array to the first set of configuration memory cells, 
thereby configuring the first array of configurable logic 
blocks in a first configuration. 

7. The FPGA of claim 6, further comprising means for 
loading the second set of configuration data values and the 
second set of state data values from the first configuration 
cache memory array to the first set of configuration memory 
cells, thereby configuring the first array of configurable logic 
blocks in a second configuration. 

8. The FPGA of claim 6, further comprising means for 
loading the first set of configuration data values and the 
second set of state data values from the first configuration 
cache memory array to the first set of configuration memory 
cells, thereby configuring the first array of configurable logic 
blocks in a second configuration. 

9. The FPGA of claim 1, wherein each of the configurable 
logic blocks in the first array of configurable logic blocks has 
a corresponding local configuration memory block in the 
first configuration cache memory array. 

10. The FPGA of claim 9, wherein each local configura- 
tion memory block stores values associated with a plurality 
of configurations of a corresponding configurable logic 
block. 

11. The FPGA of claim 1, further comprising a configu- 
ration port for accessing the first and second configuration 
cache memory arrays. 

12. The FPGA of claim 1, wherein the control circuit is 
configured to load values from the first configuration cache 
memory array to the first set of configuration memory cells 
over a plurality of operating cycles. 

13. The FPGA of claim 12, wherein the control circuit is 
configured to load values from the second configuration 
cache memory array to the second set of configuration 
memory cells over a plurality of operating cycles. 

14. The FPGA of claim 1, wherein the first array of 
configurable logic blocks is arranged in a plurality of rows 
and the first configuration cache memory array is arranged 
in a plurality of rows, wherein the rows of the first array of 
configurable logic blocks are interleaved with the rows of 
the first configuration cache memory array. 

15. The FPGA of claim 14, wherein the second array of 
configurable logic blocks is arranged in a plurality of rows 
and the second configuration cache memory array is 
arranged in a plurality of rows, wherein the rows of the 
second array of configurable logic blocks are interleaved 
with the rows of the second configuration cache memory 
array. 

16. The FPGA of claim 15, wherein the first array of 
configurable logic blocks and the first configuration cache 
memory array are located on a first half of the FPGA, and 
the second array of configurable logic blocks and the second 
configuration cache memory array are located on a second 
half of the FPGA. 

17. The FPGA of claim 1, wherein the values stored in the 
first configuration cache memory array comprise a first set of 
state data values for programming corresponding flip-flops 
in the configurable logic block. 

18. The FPGA of claim 1, wherein the values stored in the 
first configuration cache memory array comprise a first set of 
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state data values for programming corresponding function 
generators in the first array of configurable logic blocks to 
perform a first function. 

19. A field programmable gate array (FPGA) comprising: 
a first array of configurable logic blocks having a corre- 
sponding first set of configuration memory cells; 

a first configuration cache memory array coupled to the 
first array of configurable logic blocks, wherein the first 
configuration cache memory array stores values to be 
loaded into the first set of configuration memory cells, 
thereby reconfiguring the first array of configurable 
logic blocks; 

a second array of configurable logic blocks having a 
corresponding second set of configuration memory 
cells; 

a second configuration cache memory array coupled to the 
first configuration cache memory array, wherein the 
second configuration cache memory array stores values 
to be loaded into the first set of configuration memory 
cells, thereby reconfiguring the first array of config- 
urable logic blocks; and 

control circuitry for transferring values from the second 
configuration cache memory array to the first configu- 
ration cache memory array. 

20. Afield programmable gate array (FPGA) comprising: 

a first array of configurable logic blocks having a corre- 
sponding first set of configuration memory cells; 

a first configuration cache memory array coupled to the 
first array of configurable logic blocks, wherein the first 
configuration cache memory array stores values to be 
loaded into the first set of configuration memory cells, 
thereby reconfiguring the first array of configurable 
logic blocks; 

a second array of configurable logic blocks having a 
corresponding second set of configuration memory 
cells; 

a second configuration cache memory array coupled to the 
second array of configurable logic blocks; and 

a control circuit for configuring the second configuration 
cache memory array to either store values to be loaded 
into the second set of configuration memory cells, 
thereby reconfiguring the second array of configurable 
logic blocks, or to operate as a random access memory. 

21. A method of operating a field programmable gate 
array (FPGA) comprising the steps of: 

providing a first array of configurable logic blocks and a 
second array of configurable logic blocks on the FPGA; 

providing a first configuration cache memory array 
coupled to the first array of configurable logic blocks 
and a second configuration cache memory array 
coupled to the second array of configurable logic 
blocks; 

reconfiguring the first array of configurable logic blocks 
in response to values stored in the first configuration 
cache memory array; and 

reconfiguring the second array of configurable logic 
blocks in response to values stored in the second 
configuration cache memory array, wherein the step of 
reconfiguring the first array of configurable logic 
blocks is performed independent of the step of recon- 
figuring the second array of configurable logic blocks, 
thereby enabling partial reconfiguration of the FPGA. 

22. The method of claim 21, wherein the step of recon- 
figuring the first array of configurable logic blocks further 
comprises the step of loading the values stored in the first 
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configuration memory array to the first array of configurable 
logic blocks over a plurality of operating cycles. 

23. The method of claim 22, wherein the values stored in 
the first configuration memory array comprise a first set of 
configuration data values for programming corresponding 5 
configuration memory cells in the first array of configurable 
logic blocks in a first configuration, and a first set of state 
data values for programming corresponding state storage 
structures in the first array of configurable logic blocks to 
perform a first function, the method further comprising the 10 
steps of: 

clearing the first array of configurable logic blocks; 
loading the first set of configuration data values into the 

first array of configurable logic blocks over a plurality 

of operating cycles; and 

loading the first set of state data values into the first array 
of configurable logic blocks. 

24. The method of claim 22, wherein the values stored in 
the first configuration memory array comprise a first set of 
configuration data values for programming corresponding 
configuration memory cells in the first array of configurable 
logic blocks in a first configuration, and wherein the first 
array of configurable logic blocks includes a plurality of 
state storage structures which store a first set of state data 
values, the method further comprising the steps of: 

storing the first set of state data values in the first 
configuration memory array; 

clearing the first array of configurable logic blocks; 

loading the first set of configuration data values from the 30 
first configuration memory array to the first array of 
configurable logic blocks over a plurality of operating 
cycles; and 

loading the first set of state data values from the first 
configuration memory array into the first array of 35 
configurable logic blocks. 

25. A method of operating a field programmable gale 
array (FPGA) comprising the steps of: 

providing a first array of configurable logic blocks and a 
second array of configurable logic blocks on the FPGA; 

providing a first configuration cache memory array 
coupled to the first array of configurable logic blocks 
and a second configuration cache memory array 
coupled to the first configuration cache memory array; 

reconfiguring the first array of configurable logic blocks 
in response to values stored in the first configuration 
cache memory array; 

transferring the values stored in the second configuration 
cache memory array to the first configuration cache 
memory array; and 

reconfiguring the first array of configurable logic blocks 
in response to values transferred from the second 
configuration cache memory array to the first configu- 
ration cache memory array. 

26. A field programmable gate array (FPGA) comprising: 
an array of configurable logic blocks; 

an array of random access memory (RAM) blocks for 
storing one or more sets of configuration data values for 
configuring the configurable logic blocks, wherein each ^ 
RAM block has a corresponding configurable logic 
block; 

a plurality of configuration buses, wherein each configu- 
ration bus couples a RAM block to a corresponding 
configurable logic block; and 65 

a control circuit for transferring values between the array 
of configurable logic blocks and the array of RAM 
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blocks, wherein the control circuit is configured to 
route a set of configuration data values from the RAM 
blocks to the configurable logic blocks on the configu- 
ration buses over a plurality of cycles. 

27. The FPGA of claim 26, wherein each RAM block 
comprises: 

a configuration data cache memory for storing configu- 
ration data values for configuring a configurable logic 
block; and 

a state data cache memory for storing state data values 
associated with the configurable logic block. 

28. The FPGA of claim 27, wherein each RAM block 
further comprises a memory dedicated to storing state data 
values. 

29. The FPGA of claim 26, wherein each RAM block 
further comprises a plurality of local cache memories, 
wherein each local cache memory comprises a configuration 
data cache memory for storing configuration data values for 
configuring a configurable logic block, and a state data cache 
memory for storing state data values associated with the 
configurable logic block. 

30. Afield programmable gate array (FPGA) comprising: 
an array of configurable logic blocks; 

an array of random access memory (RAM) blocks for 
storing one or more sets of configuration data values for 
configuring the configurable logic blocks, wherein each 
RAM block has a corresponding configurable logic 
block; and 

a plurality of configuration buses, wherein each configu- 
ration bus couples a RAM block to a corresponding 
configurable logic block, wherein the configuration 
data values are routed from the RAM blocks to the 
configurable logic blocks on the configuration buses 
over a plurality of cycles, wherein the configuration 
buses are bi-directional, such that the configuration 
data values are also routed from the configurable logic 
blocks to the RAM blocks on the configuration buses. 

31. A field programmable gate array (FPGA) comprising: 

an array of configurable logic blocks; 

an array of random access memory (RAM) blocks for 
storing one or more sets of configuration data values for 
configuring the configurable logic blocks, wherein each 
RAM block has a corresponding configurable logic 
block; 

a plurality of configuration buses, wherein each configu- 
ration bus couples a RAM block to a corresponding 
configurable logic block, wherein the configuration 
data values are routed from the RAM blocks to the 
configurable logic blocks on the configuration buses 
over a plurality of cycles; 

a plurality of random access buses, wherein each random 
access bus couples a RAM block to a corresponding 
configurable logic block, each of the random access 
buses comprising an address bus for providing an 
address from the configurable logic block to the RAM 
block, a first data bus for providing data from the 
configurable logic block to the RAM block, and a 
second data bus for providing data from the RAM 
block to the configurable logic block; and 

circuitry for accessing the RAM blocks as random access 
memory through the configurable logic blocks using 
the random access buses. 

32. The FPGA of claim 31, further comprising an expan- 
sion bus which is programmable to couple adjacent RAM 
blocks. 
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33. A field programmable gate array (FPGA) comprising: 

an array of configurable logic blocks; 

an array of random access memory (RAM) blocks for 
storing one or more sets of configuration data values for 
configuring the configurable logic blocks, wherein each 5 
RAM block has a corresponding configurable logic 
block, and wherein each RAM block comprises: 
a configuration data cache memory for storing configu- 
ration data values for configuring a configurable 
logic block; and 
a state data cache memory for storing state data values 
associated with the configurable logic block; 

a plurality of configuration buses, wherein each configu- 
ration bus couples a RAM block to a corresponding J5 
configurable logic block, wherein the configuration 
data values are routed from the RAM blocks to the 
configurable logic blocks on the configuration buses 
over a plurality of cycles; and 

means for independently transferring state data values 20 
between the state data cache memory and the config- 
urable logic block. 

34. A field programmable gate array (FPGA) comprising: 

an array of configurable logic blocks; 

an array of random access memory (RAM) blocks; and 
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a plurality of random access buses, wherein each random 
access bus is programmable to couple a RAM block to 
a corresponding configurable logic block, each of the 
random access buses comprising an address bus for 
providing an address from the configurable logic block 
to the RAM block, a first data bus for providing data 
from the configurable logic block to the RAM block, 
and a second data bus for providing data from the RAM 
block to the configurable logic block; and 

circuitry for accessing the RAM blocks as random access 
memory through the configurable logic blocks using 
the first buses. 

35. The FPGA of claim 34, further comprising an expan- 
sion bus which is programmable to couple adjacent RAM 
blocks. 

36. The FPGA of claim 35, wherein the expansion bus 
comprises a first expansion bus which selectively couples 
adjacent RAM blocks along a first direction, and a second 
expansion bus which selectively couples adjacent RAM 
blocks along a second direction. 

37. The FPGA of claim 34, further comprising a commu- 
nications port providing an interface between the RAM 
blocks and circuitry external to the FPGA. 



